Detection of Phoneme Boundaries Using Spiking Neurons

نویسندگان

  • Gábor Gosztolya
  • László Tóth
چکیده

Automatic speech recognition (ASR) is an area where the task is to assign the correct phoneme or word sequence to an utterance. The idea behind the ASR segment-based approach is to treat one phoneme as a whole unit in every respect, in contrast with the framebased approach where it is divided into equal-sized, smaller chunks. Doing this has many advantages, but also gives rise to some new problems. One of these is the detection of potential bounds between phones, which has an effect on both the recognition accuracy and the speed of the speech recognition system. In this paper we present three ways of boundary detection: first two simple algorithms are tested, then we will concentrate on our novel method which incorporates a spiking neuron. On examining the test results we find that the latter algorithm indeed proves successful: we were able to speed up the recognition process by 35.72% while also slightly improving the recognition performance.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Unsupervised Phoneme Segmentation of Previously Unseen Languages

In this paper we investigate the automatic detection of phoneme boundaries in audio recordings of an unknown language. This work is motivated by the needs of the project BULB which aims to support linguists in documenting unwritten languages. The automatic phonemic transcription of recordings of the unwritten language is part of this. We cannot use multilingual phoneme recognizers as their phon...

متن کامل

(S)- 3,5-Dihydroxyphenylglycine )an agonist for group I metabotropic glutamate receptors( induced synaptic potentiation at excitatory synapses on fast spiking GABAergic cells in visual cortex

Introduction: (S)- 3,5-Dihydroxyphenylglycine (DHPG) is an agonist for group I metabotropic glutamate receptors. DHPG-induced synaptic depression of excitatory synapses on hippocampal pyramidal neurons is well known model for synaptic plasticity studies. The aim of the present study was to examine the effects of DHPG superfusion on excitatory synapses on pyramidal and fast-spiking GABAergic cel...

متن کامل

Phoneme Boundary Detection using Deep Bidirectional LSTMs

In this paper we investigate the automatic detection of phoneme boundaries in audio recordings with the help of deep bidirectional LSTMs. This work is motivated by the needs of the project BULB which aims to support linguists in documenting unwritten languages. The automatic detection of phoneme boundaries in audio recordings of a new language is part of the technical requirements of the BULB p...

متن کامل

Novel Entropy based moving average

The training of precise speech recognition models depends on accurate segmentation of the phonemes in a training corpus. Segmentation is typically performed using HMMs, but recent speech recognition work suggests that the transient acoustic features characteristic of manner-class phoneme boundaries (landmarks) may be more precisely localized using acoustic classifiers specifically designed for ...

متن کامل

Lexical Plasticity in Early Bilinguals Does Not Alter Phoneme Categories: I. Neurodynamical Modeling

Abstract Sebastián-Gallés et al. [The influence of initial exposure on lexical representation: Comparing early and simultaneous bilinguals. Journal of Memory and Language, 52, 240-255, 2005] contrasted highly proficient early Spanish-Catalan and Catalan-Spanish bilinguals, using Catalan materials in a lexical decision task (LDT). They constructed two types of experimental pseudowords, substitut...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2008